Modifications of the Burrows and Wheeler Data Compression Algorithm

نویسندگان

  • Bernhard Balkenhol
  • Stefan Kurtz
  • Yuri M. Shtarkov
چکیده

In 1994 Burrows and Wheeler [3] described a universal data compression algorithm (BWalgorithm, for short) which achieved compression rates that were close to the best known compression rates. Due to it’s simplicity, the algorithm can be implemented with relatively low complexity. Fenwick [5] described ideas to improve the efficiency (i.e. the compression rate) and complexity of the BW-algorithm. He also discusses relationships of the algorithm with other compression methods. Schindler [12] proposed a Burrows and Wheeler Transformation (BWT, for short) that is based on a limited ordering. This speeds up the algorithm for compression, but slows it down for decompression and slightly decreases the efficiency. Larsson [8] describes relationship of the BWT with suffix trees and with context trees. Sadakane [11] suggests a method to compute the BWT faster, and compares it to other methods. Recently Balkenhol and Kurtz [1] gave a thorough analysis of the BWT from an information theoretic point of view. They described implementation techniques for data compression algorithms based on the BWT, and developed a program with a better compression rate. In this paper we improve upon these previous results on the BW-algorithm. Based on the context tree model, we consider the specific statistical properties of the data at the output of the BWT. We describe six important properties, three of which have not been described elsewhere. These considerations lead to modifications of the coding method, which in turn improve the coding efficiency. We shortly describe how to compute the BWT with low complexity in time and space, using suffix trees in two different representations. Finally, we present experimental results about the compression rate and running time of our method, and compare these results to previous achievements. More references on the methods described in this paper can be found in [1, 5].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Burrows-Wheeler compression with modified sort orders and exceptions to the MTF phase, and their impact on the compression rate

This thesis examines two modifications of the Burrows-Wheeler compression algorithm. The effect of non-standard sort orders during the sorting phase of the Burrows-Wheeler transform is analyzed and a new method to find good sort orders developed and compared to previous work. The process is generalized to allow multiple sort orders in the transform and the reversibility in some cases is shown. ...

متن کامل

The Burrows-Wheeler Algorithm

The Burrows-Wheeler Algorithm was published in the year 1994 by Michael Burrows and David Wheeler in the research report “A Block-sorting Lossless Data Compression Algorithm”. This research report is based on an unpublished work by David Wheeler from the year 1983. The Burrows-Wheeler Algorithm will used for data compression. The algorithm consists of several stages and these stages are perform...

متن کامل

Improvements to the Burrows-Wheeler Compression Algorithm: After BWT Stages

The lossless Burrows-Wheeler Compression Algorithm has received considerable attention over recent years for both its simplicity and effectiveness. It is based on a permutation of the input sequence − the Burrows-Wheeler Transform − which groups symbols with a similar context close together. In the original version, this permutation was followed by a Move-To-Front transformation and a final ent...

متن کامل

Lossless Compression of Ecg Signals

In this paper we study the compression techniques for electrocardiogram (ECG) signals based on Block Sorting Techniques. We introduce a new and faster block transformation than the Burrows and Wheeler Transformation (BWT), and later compare them for ECG data compression. We show that our algorithm yields better compression gain than the Burrows and Wheeler’s algorithm (BWA), Gzip and the Shorte...

متن کامل

One attempt of a compression algorithm using the BWT

In 1994 Burrows and Wheeler [5] described a universal data compression algorithm (BW-algorithm, for short) which achieved compression rates that were close to the best known compression rates. Due to it’s simplicity, the algorithm can be implemented with relatively low complexity. Fenwick [8] described ideas to improve the efficiency (i.e. the compression rate) and complexity of the BW-algorith...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999